-
Notifications
You must be signed in to change notification settings - Fork 25.7k
Add new GHA workflow to cache ROCm CI docker images on MI300 CI runners periodically #148394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148394
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit bd72c96 with merge base 38e81a5 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
1603ad9 to
8d69068
Compare
|
@jeffdaily All lint jobs are clean. This workflow will only be triggered on a schedule. Latest ciflow/rocm-mi300 jobs show the docker pull time decreased as mentioned in PR description. |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Refiling #148387 from pytorch repo branch to get AWS login via OIDC working
Successful docker caching run: https://github.com/pytorch/pytorch/actions/runs/13843689908/job/38737095535


Run without cached docker image: https://github.com/pytorch/pytorch/actions/runs/13843692637/job/38746033460
Run with cached docker image:
~6 min vs 3 s :)
Thanks @saienduri for the help on the MI300 infra side
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd